Picture for Haoran Xu

Haoran Xu

SynthVerse: A Large-Scale Diverse Synthetic Dataset for Point Tracking

Add code
Feb 04, 2026
Viaarxiv icon

Video-OPD: Efficient Post-Training of Multimodal Large Language Models for Temporal Video Grounding via On-Policy Distillation

Add code
Feb 03, 2026
Viaarxiv icon

Federated Balanced Learning

Add code
Jan 20, 2026
Viaarxiv icon

Vision Also You Need: Navigating Out-of-Distribution Detection with Multimodal Large Language Model

Add code
Jan 20, 2026
Viaarxiv icon

Federated Joint Learning for Domain and Class Generalization

Add code
Jan 18, 2026
Viaarxiv icon

REST: Diffusion-based Real-time End-to-end Streaming Talking Head Generation via ID-Context Caching and Asynchronous Streaming Distillation

Add code
Dec 12, 2025
Viaarxiv icon

SEDM: Scalable Self-Evolving Distributed Memory for Agents

Add code
Sep 11, 2025
Viaarxiv icon

Towards Affordance-Aware Robotic Dexterous Grasping with Human-like Priors

Add code
Aug 12, 2025
Viaarxiv icon

Make Your MoVe: Make Your 3D Contents by Adapting Multi-View Diffusion Models to External Editing

Add code
Aug 11, 2025
Viaarxiv icon

Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation

Add code
Jul 09, 2025
Figure 1 for Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation
Figure 2 for Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation
Figure 3 for Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation
Figure 4 for Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation
Viaarxiv icon